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; ABSTRACT 



CNI ■ We perform a detailed analysis of the optical gravitational lens ER 0047-2808 

imaged with WFPC2 on the Hubble Space Telescope. Using software specifically de- 
signed for the analysis of resolved gravitational lens systems, we focus on how the 
image alone can constrain the mass distribution in the lens galaxy. We find the data 
are of sufficient quality to strongly constrain the lens model with no a priori assump- 
tions about the source. Using a variety of mass models, we find statistically acceptable 
results for elliptical isothermal- like models with an Einstein radius of 1.17" An ellip- 
f*^S • tical power-law model (E oc Rr^) for the surface mass density favours a slope slightly 

steeper than isothermal with (3 = 1.08±0.03. Other models including a constant M/L, 
pure NFW halo and (surprisingly) an isothermal sphere with external shear are ruled 
out by the data. We find the galaxy light profile can only be fit with a Sersic plus point 
source model. The resulting total M/Lb contained within the images is 4.7/i65 ± 0.3. 
In addition, we find the luminous matter is aligned with the total mass distribution 
within a few degrees. This is the first time a resolved optical gravitational lens image 
has been quantitatively reproduced using a non-parametric source. 

The source, reconstructed by the software, is revealed to have two bright regions, 
with an unresolved component inside the caustic and a resolved component straddling 
r> | a fold caustic. The angular size of the entire source is ~ 0.1"and its (unlensed) Lyman- 

^ ■ a flux is 3 x 10~ 17 erg s _1 cm~ 2 . 

Key words: Gravitational Lensing — galaxies: individual (0047-2808) — galaxies: 
high rcdshift 
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1 INTRODUCTION 

The discovery jHewitt et all Il988l: ILangston et al. Il98£ ) 
and subsequent modelling iKochanel^^Naravanr 1992c ; 
iKochanekl ll995Tl of Einstein ring gravitational lenses at 
radio wavelengths raised the prospect of obtaining de- 
tailed quantitative information for both the lensed sources 
and the form of the mass distr i bution in the interven- 
ing lensing galaxies. iKochaneld (|1995f) compared mass 
models for t he lensed radio lobe MG1 654+134 using 
LensCLEAN jKochanek fc Naravanll992bJ) m image space. 
It was subsequently shown that LENSCLEAN-ing radio 
data in image space can introduce biases, hence visibility- 
based LensCLEAN was introduced l|Ellithorpe et alJll996t 
IWucknitzlfeoiMl) . IWucknitz et alJ J2004I) recently used this 
method to measure Ho in B0218+357. 

Recent work at optical/near-IR wavelengths 

llKochanek et alJ l200d. l200lf) has concentrated on what 



can be learnt using in fr ared images of Einstein rings. 
iMiralda-Escude fc Leharl dl992f) pointed out that there 
should exist a much larger number of Einstein rings 
detectable at optical wavelengths, an d systematic survey s 
for such systems are now underway dHewett et alj hoOCO. 
ER 0047-2808, the first known example of a galaxy lensing 
another normal galaxy, was the first Einstein ring to be 
discovered at optical wavelengths iWarren et alj ll996). The 
ER 0047-2808 system, together with the recently discovered 
lens 1RXS J113155.4-123155 iSluse et alJ l2003Tl . are the 
only Einstein ring systems with good S/N Hubble Space 
Telescope (HST) images at optical wavelengths. Modelling 
of the system, exploiting the extended surface brightness 
distribution of the lensed source, would provide details 
of the source morphology and allow some discrimination 
between models for the mass distribution in the intervening 
lensing galaxy. Compilation of a large sample of optical 
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Einstein rings could provide information on the luminosity 
function and morphologies of star-forming galaxies at 
redshifts z — 3 — 5 whose (unlensed) magnitudes remain 
too faint for direct study using even 8m-class telescopes. 
Similarly, statistical properties derived from a large sample 
of such lensed systems, will provide new information on 
the still poorly constrained distribution of mass in massive 
early-type galaxies, which form the majority of the deflector 
population. 

As a first step toward achieving these goals, this pa- 
per presents an analysis of the HST 1 imaging observa- 
tions of ER 0047- 2 808. This system has been studied by 
iKoopmans fc Treul <l2003t hereafter 'KT03'), who combined 
dynamical information (the stellar velocity dispersion pro- 
file), with gravitational lens information (the angular size of 
the Einstein ring), to draw conclusions on the mass profiles 
of both the visible and dark matter. The HST image was 
used simply to determine the total mass within the ring, 
and to measure the lens galaxy light profile. In our com- 
plementary analysis, we concentrate on extracting as much 
information on the mass profile as possible from the image 
alone. We use software based on the LensMEM method 
JWallineton et alJll99d hereafter 'WKN96') to perform a 
gravitational-lens inversion of the complete surface bright- 
ness distribution around the ring. This allows us to draw 
conclusions on the radial distribution of the (total) mass, 
and also produces new insight into the source morphology. 
An advantage of the optical CCD image over radio images 
is that the counts in the pixels are independent, so that 
an unambiguous goodness-of-fit statistic may be measured. 
To analyse a radio image properly requires the covariance 
matrix for the image pixels, or to work directly with the 
visibilities dEllithorpe et alJll996t IWucknitzlEooll . 

Sectional describes the HST WFPC2 observations, the 
data reduction steps, the determination of the noise char- 
acteristics of the final image of ER 0047-2808, and the ac- 
curate measurement of the surface brightness profile of the 
lensing galaxy. Section [3] presents details of the lens mod- 
elling algorithm. Seven different mass models for the lens are 
considered. Four of the mass models provide satisfactory fits 
to the data, whereas three are rejected. A discussion of the 
results, and comparison with the analysis of KT03, is given 
in Section 4, before the principal conclusions are summarised 
in the final section. 

Throughout this paper, we assume a ACDM cosmology 
with Q A = 0.7, tt m = 0.3 and H = 65 kms^ 1 Mpc" 1 (for 
consistency with KT03) unless otherwise stated. 



2 OBSERVATIONS AND DATA REDUCTION 

2.1 Observations and frame combination 

ER 0047-2808 was observed with the WFPC2 instrument 
on HST over four orbits, on 1999 January 7. The target 
was placed on chip 4 (pixel size 0.1") of the WFPC2 and 

1 Based on observations made with the NASA/ESA Hubble 
Space Telescope, obtained at the Space Telescope Science Insti- 
tute, which is operated by the Association of Universities for Re- 
search in Astronomy, Inc., under NASA contract NAS 5-26555. 
These observations are associated with program # 6560 



exposures were made using the F555W filter, which contains 
the strong Lya emission line, at z = 3.67, of the lensed 
source. A dither step was applied between orbits, on a 2 x 
2 grid with step size N + 0.5 pixels, in order to improve 
the sampling. Two equal exposures of 1200 s were made per 
orbit, with the exception that the first exposure of the first 
orbit was only 1100 s due to the field acquisition overhead. 
The data were processed through the WFPC2 pipeline using 
the latest calibration frames. 

The first step in combining the frames was to subtract 
the average counts in the backg round from each frame. We 
then used the dither software iFruchter &: Hookl I2002T) to 
identify and eliminate cosmic rays and then to interlace the 
data into the half-pixel grid. In dither parlance, interlac- 
ing corresponds to using a delta-function drizzle footprint, 
pixfrac—0.0. This was done to ensure that the data in each 
pixel remain independent. For the same reason we did not 
apply the distortion-coefficient corrections to linearise the 
astrometry of the field. The lack of accurate astrometry over 
the field is not a concern since we are only interested in the 
immediate vicinity of the lensing galaxy. 

Cosmic-rays pose a problem since there are only two ex- 
posures per pixel. To identify cosmic-rays we firstly drizzled 
the data using a fat footprint, pixfrac=1.0, to form a slightly 
smoothed combined image. In this way information from ad- 
jacent pixels contributes to the estimate of the counts in a 
particular interlaced pixel. The combined image can then be 
used to identify where the counts in any individual exposure 
are abnormally high. Specifically, the routine driz-cr com- 
pares the drizzled image against the eight individual expo- 
sures to identify cosmic rays in each exposure. The affected 
pixels were then masked, an improved average frame was 
formed, and further cosmic-rays were identified. In the re- 
gion around the galaxy we checked carefully the cosmic-ray 
identifications of driz_cr in order to ensure the u-rejection 
level was set at a value close to optimal. The driz_cr routine 
proved to be highly effective. Once the cosmic-ray masks 
for the eight individual exposures had been defined the data 
were interlaced, averaging the data pairs, after scaling all 
frames to a common exposure time of 1187.5 s. 

The final combined frame is shown in Fig.[I\(LHS). The 
pixels are shown with side 0.05", i.e. half the original pixel 
size, since this is the interlaced pixel separation. However, it 
should be understood that this is for illustration only. Since 
the data have been interlaced, the original pixel size, i.e. 
0.1", has been retained. In other words, Fig. illustrates 
four frames simultaneously, each offset from each other on a 
2x2 grid of side 0.05". In all the fits described in this paper 
we account for this data format. This is necessary because 
of the undersampling of the WFPC2 point spread function 
(PSF). 

The image contains a large number of useful pixels for 
measuring the surface brightness distribution of the lens 
galaxy (~ 10 4 ) and the Einstein ring (~ 10 3 ). In such cases 
it is essential to establish the accuracy to which the uncer- 
tainty of the counts in each pixel is known. This is because 
the corresponding uncertainty to which \ 2 mav be deter- 
mined can be as large, or larger, than the width of the x 2 dis- 
tribution itself. For example, if the number of degrees of free- 
dom in the model fitting (~ no. pixels) is ~ 10 4 , the reduced 
X 2 is normally distributed as xl = l-0± \fW" = 1.0±0.014. 
For this case, in order that the uncertainty in x 2 is not dom- 
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inated by the uncertainty in the determination of the noise 
in each pixel, it would be necessary to determine the noise 
to a fractional accuracy substantially better than 0.007. 

A Poisson estimate of the la noise-frame was com- 
puted, incorporating read and photon noise, and accounting 
for elimination of data due to cosmic-rays, i.e. whether 0, 1, 
or 2 exposures contribute to an individual pixel. Standard 
WFPC2 values of the read-noise, 5e~, and gain, 7e~ / ADU , 
were adopted. To check the accuracy of the estimated uncer- 
tainties, the final combined data frame was divided by the 
la noise-frame to form a signal-to-noise ratio (S/N) frame. 
The standard deviation of the counts in the background, 
which should be unity if the uncertainties are correct, was 
found to be 1.06 ± 0.01, where the quoted uncertainty in 
this value was established from the scatter in measurements 
from several areas around the frame. It is unclear whether 
the small underestimate of the noise is due to lack of preci- 
sion in the value of the read noise or the value of the gain. 
Therefore, the la noise-frame was simply scaled by a factor 
1.06. The accuracy with which this scaling can be measured, 
1%, means that the uncertainty in measured values of \ 2 is 
2%. 

2.2 Galaxy surface brightness profile 

To subtract the image of the galaxy, necessary to isolate 
the image of the lensed source, we fitted parameterised 2- 
D models of the galaxy surface brightness profile convolved 
with the point spread function, us ing a modification of our 
own software l| Warren et alJ l2001'l appropriate for drizzled 
data. The goodness-of-fit was calculated by summing x 2 
over the four separate interlaced frames. To prevent the 
presence of the lensed background source from affecting the 
model fit to the galaxy, we masked out pixels in the vicin- 
ity of the Einstein ring. This was achieved in an iterative 
fashion, firstly subtracting the galaxy fit, then smoothing 
and thresholding the residuals to create a first mask, then 
refitting, and then refining the mask, etc. The final mask 
contains 842 pixels and is shown in Fig. Q (centre). 

Fitting the surface brightness profiles of early-type 
galaxies, which are cuspy at the centre, requires care, par- 
ticularly where the data are undersampled. To generate a 
model image for each interlaced frame, the model galaxy 
profile was firstly integrated over sub-pixels of size .1" /3. 
The PSF w as computed using the tinytim software (|KristJ 
Il993l . Il995f ). integrated over the same sub-pixel size. The 
model image was then convolved with the model PSF. The 
HST PSF is undersampled by WFPC2 at the wavelength 
of our observations and the sub-pixelation is necessary to 
correctly reproduce the actual image that would be seen by 
the detector. After rebinning this convolved image to the 
full 0.1" pixel size, the image was further convolved with 
the WFPC2 pixel scattering function, as the final step in 
forming a model image. The pixel scattering function ac- 
counts for the diffusion of electrons from the collecting pixel 
into nearby pixels. We used the 3 x3 kernel specifi ed in 
the WFPC2 Instrument Handbook ijBiretta et all2002h . For 
this kernel, a fraction 0.365 of the collected electrons diffuse 
out of the central pixel, principally into the four nearest 
pixels. So the scattering function broadens the PSF signifi- 
cantly. Because of the cuspy galaxy light profile, the kernel 
is an essential element in the model fitting. However, the 



kernel was determined from pre-flight measurements, and it 
has not been remeasured in orbit. Therefore, somewhat sur- 
prisingly, it is a significant source of uncertainty (see below) . 

In the following we use X, Y for the coordinates on 
the CCD, and x,y for, respectively, the major and minor 
principal axes of any ellipse. The radius R is defined by 
R 2 — x 2 q + y 2 /q, where q = b/a is the ellipse axis ratio. 
The galaxy model parameters are the X, Y position of the 
centre, the axis ratio q, and the angle of orientation <j>, plus 
the parameters of the function defini ng the surface b right- 
ness profile. A de Vaucouleurs profile (|de Vaucouleurs|| l948j) 
did n ot provide a satisfactory fit, so a Sersic model jSersid 
1968), where the surface brightness as a function of R is 
S(R) = S 1/2 exp{-B(n)[(R/R 1/2 ) 1/n - 1]}, was tried. The 
parameter n quantifies the shape of the profile: the values 
n — 0.5, n — 1, and n = 4 correspond to the Gaussian, ex- 
ponential, and de Vaucouleurs profiles. Profiles with larger 
n are more sharply peaked. For cuspy galaxies, n > 3, spe- 
cial care is needed in integrating the Sersic function over 
sub-pixels near the centre. B(n) is a constant for particu- 
lar n, and we used the serie s asym ptotic solution for B(n) 
provided bv lCiotti fc Bertinl Jl999h . 

With the Sersic function we obtained fits that were glob- 
ally satisfactory, in terms of x? ■ Nevertheless the fit over 
the central 0.5 x 0.5" was bad. We also found that the fit- 
ted value of n depended sensitively on the size of the box, 
increasing as the box size diminished. This could be be- 
cause either the Sersic profile is not cuspy enough for this 
galaxy, or the pixel scattering kernel used is too broad. To 
investigate the latter we modified the kernel, reducing the 
scattered fraction from 0.365 to 0.2, but the central fit was 
still unsatisfactory. We therefore retained the standard ker- 
nel, and added a point source at the centre of the model 
surface brightness profile. The full model has 8 free param- 
eters; X, Y, q, <f>, Si/2, Ri/2, n , and p, the counts in the point 
source. Here it should be understood that R1/2 is the half- 
light radius of the Sersic component of the light profile, and 
not the radius containing half the total galaxy light. The 
final fit was made to a square-region, 131 x 131 drizzled pix- 
els, i.e. 6.55" x 6.55" in extent, centred on the galaxy, the 
entire region reproduced in Fig. The fit is satisfactory, 
both globally, and in the central region. 

The image after subtraction of the galaxy profile is 
shown in Fig. Q (RHS). We have designated the main com- 
ponents of the lensed image in a similar fashion to KT03, 
with the four main bright spots labelled A, B, C, & D clock- 
wise from the NW component. We add the labels E for the 
arc of extended emission between images B and C, and Al 
for the extended emission north of A. The increased Poisson 
noise visible toward the centre of the subtracted image is a 
consequence of the steep luminosity profile. Because of this, 
a demagnified central (fifth) image of the source would not 
be detectable with these data. 

The reduced x 2 of the Sersic + point source fit is 
\l = 0.96±0.02, whereas the expected value is 1.000±0.011 
for v — 16311. These two values are consistent, and both the 
fit and the estimated uncertainties are therefore deemed sat- 
isfactory. (Note that in this case the accuracy to which xl 
can be measured (2%, §2.1) is worse than the standard devi- 
ation of the Xv distribution for the given number of degrees 
of freedom.) 

The parameters of the fit are provided in Tabled Mag- 
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Figure 1. The figure shows a region 6.55" X 6.55" (131 X 131 pixels) centred on the lens galaxy of ER 0047-2808; LHS: the interlaced 
final WFPC2 image, centre: the mask used in the galaxy fit, RHS: after subtraction of the galaxy image (as described in the text). The 
four bright spots in the image are labelled A— D. The arc of extended emission between B & C is labelled E. The small arc of extended 
emission north-east of A is labelled Al 



nitudes are on the VEGAMAG system for t he F555W fil- 
ter, a nd use the zero point 22.538, for chip 4 l|Baggett et alJ 
2002). The orientation of chip 4 was taken from the im- 
age header. The PA of the Y axis of chip 4 was at 27.44° 
for these observations. To calculate the PA, hereafter 9q, 
this angle was added to <j>, the measured orientation of the 
galaxy image on the chip. 

For the best fit n, we have B(n) = 5.89. For this 
model, the galaxy total magnitude (Sersic + point source) 
is V555 = 20.61. KT03 obtained the same value for the to- 
tal magnitude, fitting a de Vaucouleurs profile, following 
a similar procedure to that described here, but which dif- 
fers in detail. Howe ver, in common with other authors (e.g. 
iBlanton et alj |200T). we caution that meaningful total mag- 
nitudes cannot be measured for early-type galaxies. This is 
made clear when we compare the best-fit Sersic only model, 
to the Sersic + point source fit. In terms of aperture mag- 
nitude the two profiles agree to within a few percent at all 
radii over the fitted region, except at the very centre. How- 
ever, the best-fit Sersic profile (which has n = 6.9) has very 
extended low-level wings beyond the fitted region, and a to- 
tal magnitude 40% brighter. This shows that the models are 
only useful over the region of the fit. For similar reasons it is 
not meaningful to compare half-light radii of fits of different 
models. 

The uncertainties quoted in the table are the random 
errors, computed by inverting the \ 2 curvature matrix. The 
actual uncertainties in some of the parameters are larger, 
and are dominated by the uncertainty of the pixel scatter- 
ing function. For example, for the sharper kernel considered 
above, where the scattered fraction is 0.2, the parameter n 
increases to 3.7, and the brightness of the point source re- 
duces to V555 = 24.3. Although the values of the parameters 
of the fits for the different kernels are quite different, again 
the profiles themselves are very similar over the region of 
the fit, except at the very centre. 

We find satisfactory agreement between the Sersic + 
point source profile and our previous fit of a d e Vaucouleurs 
profile to a ground based image of the galaxy dWarren et alJ 
Il996l) . The previous measured values of q and PA are con- 
sistent with the new values. The old fit gave an integrated 
brightness V = 21.12 within the measured effective radius 
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Table 1. Photometric parameters for the lens galaxy, measured 
assuming the standard WFPC2 pixel scattering kernel. 

R e = 1.39". For the Sersic + point source model fit to the 
HST image, within the same radius we find V555 = 21.17, in 
excellent agreement. 

To summarise, using an 8-parameter Sersic + point 
source model, convolved with the HST PSF, we obtained 
a good fit to the galaxy surface brightness profile, at all 
radii over the 6.55" x 6.55" box. The measured values of 
the parameters are somewhat sensitive to the adopted pixel 
scattering kernel. Nevertheless, over the fitted region, the 
deconvolved surface brightness profiles determined are not 
sensitive to this function. The galaxy model was subtracted, 
to leave the Einstein-ring image to be modelled. 

2.3 Absolute magnitude 

The relation between absolute magnitude in the B band, 
M B , and V555 is: 

M B = V555 - 51og 10 (d £ /10) - k 555 + (B- V 55 s)o - 9555, 

where cLl is the bolometric luminosity distance in pc, £555 is 
the k— correction for the V555 filter, (B — Vs5s)o is the galaxy 
colour in the restframe, and $555 is the Galactic extinction 
for this galaxy at the wavelength of the V555 filter. The com- 
bined term t = — fc555 + (-B — 1/555)0 depends on the spectrum 
of the galaxy (corrected for Galactic reddening). To deter- 
mine t, KT03 computed synthetic spectra that match the 
measured colour of the galaxy V555 — isi4 = 1-94 ± 0.12, 
finding t — —0.439 (properly the colour should be corrected 
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for Galactic reddening, but the correction is very small com- 
pared to the colour photometric error). Note that the colour 
of an old elliptical at this redshift would be V555— Isi4 ~ 2.4. 

We have performed simil ar calculations, using the BC03 
population synthesis code feruzual fc Chariot! 120031) . for 
galaxies of solar metallicity, and of specified formation red- 
shift Zf, with an exponentially declining star formation rate, 
of timescale r. Combinations Zf = 2, r = lGyr, and 2/ = 4, 
t — 1.5Gyr, reproduce the measured colour, and yield simi- 
lar values of t to that computed by KT03. Therefore we have 
adopted their value for t, as well as their value 3555 = 0.052. 
The final result is Mb = V555 - 42.834 + 51og/i 6 5, used to 
compute the value of Mb (tot) quoted in Table Q 

The BC03 model spectra provide an additional useful 
quantity, the restframe B— band luminosity evolution be- 
tween z — 0.485 and 0. The results for the two models are 
Am = 0.9mag. 



3 GRAVITATIONAL LENS INVERSION 

The HST image, Fig. shows four bright peaks. The lens- 
ing analysis by KT03 was limited to fitting the positions of 
the four peaks in the ring, on the assumption that these are 
all images of the same source. This is similar to the proce- 
dure followed in analysing images of gravitationally-lensed 
quasars, and is usually satisfactory for accurate measure- 
ment of the total mass within the ring. Nevertheless, grav- 
itational lenses like ER 0047-2808, for which the source is 
resolved, offer the prospect of measurement of the profile of 
the mass distribution in the lens, as well as the light pro- 
file of the source. Hitherto, the small number of radio Ein- 
stein rings have provided the only systems with deflectors 
of galaxy mass, possessing such an extended distribution of 
surface brightness. The detailed structure revealed by the 
WFPC2 imaging makes ER 0047-2808 the first system dis- 
covered at optical wavelengths for which a similar analysis 
is possible. As detailed below, we have discovered that the 
four peaks are in fact due to two distinct sources. 



3.1 Inversion algorithm 

Presented with an image such as Fig. the problem is to 
determine simultaneously the source light profile and the 
lens mass profile. For a review of the different published 
solutions to this p roblem, the reader is r eferred to the Intro- 
duction section in I Warren fc Dvel |2003). A non-parametric 
solution, where the source light profile is pixelised, is pre- 
ferred to a parametric solution, because the light profiles 
of high-redshift galaxies are typically complex and there is 
no clear way to choose an appropriate parameterisation. On 
the other hand the surface mass distribution in the elliptical 
galaxy lens is expected to be much simpler, and may be pa- 
rameterised in the first instance (i.e. until a poor fit indicates 
the model needs refining e.g. to incorporate additional mass 
or shear components). A proper solution must account for 
the smearing of the image by the PSF. Because of this de- 
convolution step, some form of regularisation will usually be 
required. The LensMEM algorithm described by WKN96 
satisfies all these requirements, and is the method we have 
used here. We follow closely the methods set out in WKN96 



with a few significant improvements. Here we provide only 
an outline of the method, and detail any differences. 

The goal of the process is to find a source and lens model 
which provide an acceptable fit to the data. The quality of 
the fit to the data is quantified by the \ 2 statistic. The 
merit function that is minimised is C = x 2 — ^S, where 5", 
the regularising term, is the entropy in the source plane. The 
term A is a weight which determines the relative importance 
of the x 2 an< A entropy terms to the solution. The adjustment 
of A is explained below. 

Using Si for the counts in source pixel i, the usual 
expression for entropy is S = — ^.Sjlnsj. We have 
used the the modified term S = —^2 i Si[\n(si/A) — 1] 
JSkilling fc Brvanllf984l) . With this expression A defines a 
default source pixel value. We found the solution to be in- 
sensitive to this parameter, for appropriately small values, 
below the noise level in the image. 

The WKN96 algorithm proceeds in three nested cycles. 
The innermost cycle determines the solution for the source 
light distribution for a fixed set of lens mass parameters, 
and for fixed A. The middle cycle adjusts A. The outer cycle 
adjusts the lens mass parameters and calculates the map- 
ping matrix. For a fixed mass model, a mapping matrix is 
generated by dividing each pixel in the image into two tri- 
angles which are projected back into the source plane. The 
mapping matrix records the fraction of each source pixel 
from which a given image pixel maps. Once computed, the 
mapping matrix has convenient properties which allow pro- 
jection between source and image plane (in either direction, 
correctly conserving surface brightness) and the calculation 
of critical lines and caustics. The model image is formed by 
projecting the source through the mapping matrix, followed 
by convolution with the PSF. If A is large, the minimum— C 
solution will have a smooth source, but the quality of the 
fit, as quantified by \ 2 , may be unsatisfactory. The \ 2 °f 
the fit may be improved by reducing A. The curve of x 2 (at 
the minimum— C solution) against A is monotonic. The goal 
is to have the smoothest solution for the source that gives a 
'satisfactory' fit, defined by a target value of \ 2 ■ The mid- 
dle cycle adjusts A searching for the target \ 2 point on the 
curve for a fixed mass model. 

WKN96 used a conjugate gradient technique with 
search direction VC = V\ 2 ~ AV5* where V^ 2 is the gradi- 
ent of the \ 2 projected back into the source plane. For fixed 
A, each iteration in the inner cycle applies a scaled VC to the 
current source model such that C is minimised. The inner 
cycle stops when no more improvement can be made. There 
are two problems with WKN96's implementation. Firstly, 
the magnitude of V5* can be very large for individual pixels 
which happen to be small positive numbers due to the gra- 
dient of the entropy dS/dsi — In A — In Sj. Secondly, adding 
the m iddle cycle, to adjust A, is inefficient. ISkilling fc Brvanl 
( 1984) identified these problems (and more) and created an 
elegant algorithm to find solutions which incorporate adjust- 
ing A as part of the inner cycle. In this way there are only 
two cycles. For a fixed mass model, the algorithm adjusts A 
and the source light profile at the same time, to achieve the 
maximum-entropy solution subject to the constraint that 
X 2 equals the target value. The mass model is then adjusted 
to give the maximum of these maximum entropy solutions. 
The method achieves the same result as before, but views 
the problem differently. The term A is now seen as a La- 
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grange multiplier in a constrained minimisation problem, 
rather than as a weight on the regularisation term . We have 
used the algorithm from lSkilling fc Brvanl l|l984h in our im- 
plementation, and have found it superior to the conjugate 
gradient technique used in WKN96. 



3.2 Mass Models 

A primary goal of the study is to achieve some discrimi- 
nation between the fairly extensive range of mass/potential 
models that have been found to reproduce image configura- 
tions in gravitationally lens ed systems that possess fewer ob- 
servational constraints (e.g. lKochanekll995l) . We have tested 
six simple elliptical models. The definitions of the models are 
provided in Table The models are defined in terms of the 
radial profile of either the potential, ip, the dimensionless 
surface mass density, n, or the volume density, p. We fixed 
the centre of each model to be the centre of the light distribu- 
tion. The six mod els are: 1. the pseudo-isotherm al elliptical 
potential (PIEP) felandford fc Kochaneklll987l) . 2. the sin- 
gular isothermal sphere in an external sh ear field (SIS+7), 3. 
the s i ngular isotherma l ellip soid (SIE) llKassiola fc Kovnerl 
Il993l: iKormann et all Il994l) . 4. the power-law ellipsoid 
rSPEMDl <|Barkanal Il998h Cof which the SIE is a special 
case), 5. the constant mass-to-light ratio (M/L) model, de- 
fined by the Sersic+point-source profile fit described i n Sec- 
tion E3 and 6. the NFW model (iNavarro et"al"]ll99rl here- 
after NFW96). For each model the deviation from circular 
symmetry is parameterised by the axis ratio q = b/a or the 
external shear 7 (for the SIS+7). The profiles are referred to 
the ellipse coordinate R = (x 2 q+y 2 /q) 1 ^ 2 , where x and y are 
the major and minor principal axes with PA 9o . The mass 
scale b is normalised in angular units to the usual definition 
for the Einstein radius of a SIS: b = 4-k((t' 2 /c 2 )(D ds /D s ). 
Note that for the SPEMD, 6 is the Einstein radius only 
when /3 = 1, otherwise it should be viewed as a dimension- 
less scale factor. The Constant M/L model mass scale has 
units of Ecrit, while the NFW model has units of T, cr it/r 2 
where E cH t = (c 2 /4itG)(D s / D d D ds ) = imQh^ M Q pc~ 2 is 
the critical density of the lens. 

For the NFW parameterisation we fitted two mod- 
els, with different values of the scale radius r 3 . The mea- 
sured stellar central velocity dispersion for the lens is 
229 ± 15 km s " 1 (KT03). By comp arison with the n-body 
simulations of lBullock et alJ (2001), for the galaxy redshift 
z = 0.485, the corresponding virial mass is ~ 10 13 Mq, 
and an appropriate value of the scale length is r s ~ 50kpc. 
However the effect of the baryons would be to steepen the 
mass profile, and a smaller scale length would be appropri- 
ate. The scale lengths we chose were 25 kpc (model 6a) and 
5 kpc (model 6b) . The model with the smaller scale radius 
would be expected to have similar lensing characteristics to 
the SIE model, since the NFW density profile is approxi- 
mately oc r~ 2 around the scale radius. 

Deflection angles can be simply computed from the po- 
tentials for the PIEP and SIS+7 models, while the calcu- 
lation is_jmalvtic_ibr the SIE model. We use the FASTELL 
code fBarkanaj |l99g|) to compute deflection angles for the 
SPEMD model. The deflection angle for the constant M/L 
model must be com puted numerica lly, for which we followed 
the prescription of iKeetonl (2001). For the NFW model 



we us ed the approximation introduced by Golsc & Kneib 
(2002) to compute deflection angles. 

3.3 Source— plane pixelisation 

We chose a square source plane, offset from the centre of 
the galaxy. The size and position of the source plane were 
finalised after preliminary modelling provided an indication 
of the approximate location and extent of the source. The 
source plane should be as small as reasonably possible. If the 
source plane is made too large, most of the pixels will map 
to sky in the image plane, and this reduces the power of the 
X 2 statistic to discriminate between different mass models. 
We used a 10 x 10 grid of pixels. The source pixel size was 
chosen such that the size of the caustic (in pixels) for differ- 
ent mass models was approximately the same. The pixel size 
was chosen as a reasonable match to critical sampling of the 
image-plane resolution element transformed to the source 
plane. This criterion ensures that the full information con- 
tent of the image is used in the source reconstruction, while 
minimising the degree of regula risation required [the se is- 
sues are discus s ed mo re fully in lWarren fc Dvel i2003T) and 
iDve fc Warrenl (120041) ]. The source pixel size for all models 
was within the range 0.05"/L5 and 0.05" /3. 

Mapping the entire source plane to the image plane de- 
fines the minimum extent of the region over which x 2 should 
be measured. During the minimisation process, as the mass 
parameters vary, the size of this region varies (since the map- 
ping depends on the mass parameters). This can mean that 
the surface of the merit function is not smooth, since for 
small changes in a mass parameter the number of image 
pixels varies, as a pixel is included or excluded. For this rea- 
son we defined a fixed region, or 'mask', in the image plane 
of 822 pixels over which \ 2 is measured. The mask is chosen 
to be slightly larger than the image of the source plane in 
order to allow for size variations in the projected image of 
the source plane during minimisation. 

3.4 Discriminating between models 

Each model has 3 (or 4 for the SPEMD, 1 for the Const 
M/L) parameters for the mass model plus 100 for the source 
plane pixels. With the fixed 822 pixel mask, there are then 
719 degrees of freedom in the model and the acceptable 
range of \ 2 is 719 ± 38. 

Models which do not produce a satisfactory fit in terms 
of x 2 are ruled out. However a more stringent comparison 
is possible with the F-test. The principle of the F-test is as 
follows; for a model that is linear in the parameters, the sig- 
nificance of the improvement in \ 2 achieved by the addition 
of N extra parameters may be assessed by comparison of 
Ax 2 against the \ 2 distribution for N degrees of freedom. 
This must be applied here with a degree of caution, since we 
are not adding parameters to linear models, but comparing 
different non-linear models. 

There is an additional complication associated with the 
application of regularisation in the inversion via the entropy 
constraint. Regularisation smooths the source. This acts to 
reduce the effective number of parameters in the fit, and 
increases the effective number of degrees of freedom, by an 
amount which increases with the degree of regularisation. 
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Definition 
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Free Parameters 



PIEP 
SIS+7 
SIE 

SPEMD 
Constant M/L 
6a. NFW 
6b. NFW 



V>(R) = bR 

ip(R, 0) = bR+ ^R? cos 2(8 - O ) 
-l 



k(R) 
k(R) 
k(R) 
p(r) = 
p(r) = 



= (b/2)R~ 
-- (b/2)R~P 
-- Kl/ 2 S(R) 

£c 

(r/r 3 )(l + r/r a )' 2 

£c 

(r/r a )(l + r/r a y 2 



b,q,9 
Mo>7 (<? 
b,q, O 
b,q,e ,P 

K l/2 

p c ,q,8 () ,r t 
p c ,q,8 ,r t 



1) 



: 25 kpc 
: 5 kpc 



Table 2. Summary of lens models used in this paper. The models are defined by either their 2-dimensional lens potential, t/>(-f?), their 
dimensionless surface mass density, k(R), or their volume density, p(r). The radial coordinate is defined as r = [x 2 (l — e) + y 2 (1 + e)] 1 / 2 
or R 
b. 



(x 2 q + y 2 /q) 1 / 2 . S(R) is the Sersic function described in i|2,2I Wc characterise the isothermal-like models by their Einstein radius 



Unfortunately, there is no way to quantify this increase of 
degrees of freedom. However, the entropy constraint is sim- 
ilar to a 'zeroth-order' regularisation constraint in that it is 
the sum of contributions from individual pixels and has no 
inter-pixel dependence. Likewise the gradient of the entropy 
dS/dsi depends only on an individual pixel's value. Hence, 
the covariance between pixels is generated only through 
the overall contribution of \S to C. The use of a small 
source also helps to minimise the (unknown) increase by 
only using pixels which are required by the data. In addi- 
tion, rather than follow the usual procedure of searching for 
the smoothest source consistent with the target \ 2 (which 
one would do if particularly interested in the nature of the 
source when the mass model is known) , we instead seek the 
best fit achievable for each mass model. We implement this 
simply by setting the target \ 2 to be 680 for all models and 
note that no mass models were able to reach this lower limit 
on the range of acceptable x 2 ■ Hence, the procedure is effec- 
tively a non-negative min— \ 2 solution. This procedure again 
minimises the increase of degrees of freedom. (For these so- 
lutions discussed in Jl| the entropy term in the merit func- 
tion, \S, is a factor at least 70 times smaller than the x 2 
term.) Furthermore, our scaling of the size of the source re- 
gion to the size of the caustic for each model, ensures that 
each model is treated in a comparable way. In this way any 
(small) increase in the number of degrees of freedom will be 
quite similar for each model, and therefore will have mini- 
mal effect when comparing models. Accordingly we ignore 
this uncertainty, but demand a high level of significance in 
ruling out models. 



3.5 Errors on parameters 

If regularisation is applied, the only correct way to measure 
the errors is through Monte Carlo methods, applying ran- 
dom realisations of noise to the data, and measuring the 
spread in the parameters. However the degree of regulari- 
sation applied here is minimal because the fit amounts to a 
non-negative min— \ 2 solution, in which case the errors may 
be estimated by inverting the Hessian matrix for the x 2 sur- 
face at the minimum. We verified that the Hessian matrix 
and Monte Carlo methods provided very similar answers for 
one model, and then used the Hessian matrix for all other 
models. 



4 RESULTS 

4.1 The lens galaxy 

The results of the modelling procedure are provided in Ta- 
bic []] and illustrated in Fig. |2] Table |H] lists in successive 
columns: 1. the name of the lens model, 2. the number of 
degrees of freedom and la uncertainty, 3. the x 2 of the best 
fit, 4. the ratio of the image pixel size to the source pixel 
size, 5. the area of the caustic, and 6. the relevant best-fit 
parameters and their errors. 

Notwithstanding the above caveats on the comparison 
of models, the interpretation of these results is relatively 
straightforward. Two models, the SIS+7 and NFW(25kpc) 
models, are strongly rejected on the basis of \ 2 alone. Four 
of the models, the PIEP, SIE, SPEMD, and NFW(5kpc) 
models, provide acceptable fits, with very similar values of 
X 2 ■ This is not too surprising, since these four models all 
have very similar mass profiles over the radii of interest. 
The axis ratio qpiEP can be calculated for equivalent isoden- 
sity contours of the PIEP model (for small elli pticities) us- 
ing equation 3.3.2 of Ka ssiola fc Kovnerl (119931) . This yields 
q PIE p = 0.77 in excellent agreement with the SIE/SPEMD. 
This confirms the expected result that the models are equiv- 
alent for large q. Finally the constant M/L model lies be- 
tween the two extremes, producing a x 2 which is rejected 
with 98% confidence. 

Applying the F-test to our models, extends these re- 
sults. For example comparing the SIE model and the con- 
stant M/L model, we have Ax 2 = 86 for a change in num- 
ber of degrees of freedom of only 2. The significance of this 
is so great that we may safely rule out the constant M/L 
model. However the most interesting case is comparison of 
the SPEMD and SIE models. Here the only change is be- 
tween fixing the power law exponent to the value (3 = 1, and 
allowing it to be a free parameter. The best fit value for the 
SPEMD model is (3 = 1.08 ±0.03 i.e. a small but significant 
change. The change Ax 2 = 4.7 for AiV = 1 is significant at 
the 97% level. 

Thus, the overall mass distribution in the lens galaxy 
favours a mass power law which is slightly steeper than 
isothermal around the Einstein radius. This is important 
because Ho measurements based on lensing time delays are 
sensitive to the mass power law (E tx i? - * 3 ) at the radius 
of the images, with larger values of (3 (i.e. more centrally 
concentrated mass) leading to larger actual time delays. 
(This would then be interpreted as smaller value of Ho us- 
ing an isothermal model). Most importantly, we found no 
strong degeneracy between mass power law (/3), mass scale 
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Figure 2. Data and best fitting images for each of the lens models arc shown in the upper panels. The residual between the data and 
model is shown in the centre panels. The corresponding sources, together with the model caustics, are shown in the lower panels. 
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(So) and axis ratio (q) for the SPEMD model- a problem 
frequently encountered when m odelling quasar lenses (e.g. 
IWambsganss fc Paczvnsklll994h . 

The mass enclosed within the critical line is 3.05ft^" s x 
10 11 Mq for all successful mass models. Using the photomet- 
ric properties of the lens galaxy from Table we calculate 
the total projected M/Lb inside the Einstein radius to be 
4.7/i65 ±0.3 Mq/Lbq- This value is smaller than that calcu- 
lated by KT03 (5.4 ±0.8), however KT03 used a larger Ein- 
stein radius (1.34") and the presence of dark matter would 
make the total M/Lb increase with radius. We also note 
that the constant M/L model, if correct, would require the 
stellar M*/Lb to be 6.8/i65- 

Finally, we can infer some qualitative properties of the 
galaxy's dark halo in the region of the Einstein radius. The 
position angle of the mass distribution is almost identical 
for all successful models and is within a few degrees of the 
measured position angle for the lens galaxy. The axis ratio 
of the overall mass profile (q=0.77) is rounder than the axis 
ratio of the visible galaxy (q=0.69). This indicates that the 
halo must be substantially rounder than the visible galaxy. 



4.2 The source 

The inferred morphology of the (unlensed) source, Fig.|5] is 
particularly interesting. For all the models which provided 
satisfactory fits the reconstructed source is double, with the 
smaller component lying inside a fold caustic, and the larger, 
equally bright, component lying on top of the same fold 
caustic. This vindicates the use of a non-parametric model 
for the source. It would be very difficult to determine this 
solution for the source with a parametric model. 

Guided by these results, we created a simple two- 
component source model, lensing each component sepa- 
rately, and together, through the PIEP lens model, to create 
the images shown in Fig. |31 The lower left panel shows the 
image of the inner source component, modelled as a single 
bright pixel. The middle lower panel is the image of the 
outer source component, modelled as an elliptical Gaussian. 
The combination image is shown in the lower right panel. 
Comparing against Fig. this model shows that features 
B, C and the brighter spot in Al are unresolved images of a 
single source. Features E, A and most of Al are the image 
of the extended object which crosses the caustic. Feature D 
is the combined, barely resolved, image of both components. 
In their modelling KT03 assumed that the images A, B, C, 
D are the four images of a single source component. However 
they were unable to obtain a good fit with their model. This 
further illustrates the importanc e of using a suitably de tailed 
source model, also illustrated in I Warren fc §003) 

In contrast with the models which provide good fits, 
the source reconstructions for the SIS+7, NFW(25kpc), and 
constant M/L models consist of several fainter blobs or are 
smeared out and less well defined. There is no a priori reason 
to believe that the source morphology is simple. However, 
the complex source geometries evident for the mass models 
that produce poor fits likely arise from the code's attempt to 
compensate for the poor fits by introducing complex struc- 
ture in the source plane. 

Using the reconstructed PIEP source model, the source 
flux was calculated to be 6.5 x 10~ 17 erg s _1 cm -2 . The 
overall magnification of the source is 20, hence this is 
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Figure 3. The two components used as the model source (top 
row) and the corresponding images (bottom row). The linear size 
of the source plane shown here is 8 times its true size relative to 
the image plane. 



roughly twice the v alue expected from the Lya flux alone 
J Warren et alj fl996). The discrepancy is probably due to 
continuum around the emission line which cannot be mea- 
sured with these observations. Hence, in the following we 
assume exactly 1/2 of the source flux is contained in the 
emission line. The source emission line luminosity Ll V c is 
then 4.5 x 10 42 erg s" 1 /i^ 5 2 . Assuming the Lya line is not 
generated by AGN activity, we estimate the number of O/B 
stars 

LLyct 



N s tar 



ELyaQo — 



"St 



(1) 



required to produce the observed luminosity where EL ya 
is the energy of a Lya photon, Qo is the ionising; photo n 
luminosity of a single O/B star from fVacca et all il996t) . 



OLHf, x (jLy a /jH )(E L y a /E H0 ) and the ratio 



"Lyo 



Lya — "-"il " (J^ya/Jtip/y-^Lyc/^ilp) 

is calc ulated to be 8.2 from tables 2.1 and 4.1 in lOsterbrockl 

i4t 



( 1989) assuming a temperature of 10 K, although the ratio 
depends only weakly on temperature. Qo differs substan- 
tially depending on stellar type, ranging from 10 49 53 for an 
05 V star to 10 47 ' 9 for a B0. 5 V star. Using these values, 
we estimate N s t ar s to be between 10 3 (all 05 V stars) and 
4 x 10 4 (all BO. 5 V stars). 



5 DISCUSSION 

KT03 have undertaken an independent study of the 
ER 0047-2808 system and before discussing the implications 
of our results we consider the main results of the two studies. 
KT03 did not find acceptable fits with an SIE+7 model and 
a single component source. Given that they used the lensing 
data only to measure the total mass interior to the images, 
KT03 did not pursue more complex models. The model of 
KT03 required significant external shear and a larger Ein- 
stein radius, and consequently their critical curve was larger 
and flatter and a larger mass was enclosed within it. How- 
ever, the mass enclosed within a circular aperture of radius 
1.17" is 3.05/ieg 1 x 10 11 M Q for the KT03 model, which agrees 
well with our models. This result implies that the power- 
law slope of the mass density published in KT03 (7') can 
be revised upwards by 0.1 at the radius we are considering, 
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Lens 
Model 



Model 



Image/Source 
pixel size ratio 



Area of 
caustic (arcsec 2 ) 



Best fitting 
parameters 



1. PIEP 



2. SIS+7 



3. SIE 



4. SPEMD 



719±38 713.0 



719±38 876.5 



719±38 712.0 



718±38 707.3 



5. Constant M/L 721±38 799.2 1.5 
6a. NFW(25 kpc ) 719±38 863.1 3 



6b. NFW(5kpc) 719±38 721.2 



0.060 



0.050 



0.060 



0.060 



0.084 
0.020 



0.060 



b=1.170" ± 0.004 
q = 0.917 ± 0.004 
9 = 66.2° ± 0.6 
b=1.166" 
7 = 0.077 
8 = 60.4° 
b = 1.173" ± 0.003 
q = 0.775 ± 0.005 
9 = 65.9° ± 0.5 
b = 1.151" ± 0.007 
q = 0.76 ± 0.006 
9 = 65.9° ± 0.9 
P = 1.08 ±0.03 
k 1/2 = 0.453 
p c = 0.3225 
q = 0.95 
9 = 64.4 
p c = 1.133 ±0.003 
q = 0.918 ± 0.003 
6»n = 66.4 ± 0.8 



Table 3. Results of modelling with six mass models. Models which provide an acceptable fit are shown with la errors. Models which 
do not provide an acceptable fit are shown with the 'best fit' parameters only. The scale factor b for the SPEMD is the same as the lens 
Einstein radius only when (3 = 1. 



leading to the best estimate of the slope to be the isother- 
mal value 7' = 2.0 (L. Koopmans, private communication). 
A simple 1-dimensional calculation of the expected velocity 
dispersion for a critical radius of 1.17"yields a = 234 km s -1 
also in good agreement with the measurement of the cor- 
rected central velocity dispersion of 229 ± 15 kms -1 from 
KT03. 

Our investigation has identified four mass models 
[PIEP, SIE, SPEMD and NFW(5kpc)] which can reproduce 
the data. While the NFW(5kpc) model was successful, the 
inability of a NFW profile with a more realistic scale length 
(25 kpc ) to reproduce the properties of the ER 0047-2808 
system is almost certainly due to the neglect of the bary- 
onic component in the deflector galaxy. The baryonic com- 
ponent is expected to contribute significantly to the total 
surface mass density within the Einstein radius and the suc- 
cess of the NFW(5 kpc ) model is due to the ability of the 
smaller, much higher density, core to mimic the effect of a 
significant baryonic contribution at the galaxy centre. It is 
quite conceivable that inclusion of the baryonic component, 
and allowance for it's eff ects on the dark-matter profile (e.g. 
iTreu fc Koopmansl f2002). could reconcile the data with the 
NFW profiles predicted by simulations. 

Our modelling has shown that the data alone are able 
to tightly constrain the slope of the total mass profile of the 
galaxy. While the elliptical isothermal-like models are all 
consistent with the data, our models favour a mass profile 
which is slightly steeper than isothermal (E tx ^ _1 ' 08±0 ' 03 ) 
around the Einstein radius. The difference in slope corre- 
sponds to a ~ 8% systematic reduction in the value of Ho 
which would be inferred from a time-delay measurement if 
a pure isothermal model were used. 

Three mass models [SIS+7, Constant M/L and 
NFW(25 kpc )] are not capable of reproducing the data. To 
understand the potential discriminating power of the data 
we have plotted contours of equal magnification over the 



data for the successful models (Fig. 2J, and unsuccessful 
models (Fig. The diagrams show the magnification in 
each region of the image. The critical lines (not shown) lie 
between the closely spaced contours where the magnification 
is equal to a factor of 20. 

Fig. [I] shows the subtle differences between the suc- 
cessful models. The SIE model is indistinguishable from the 
PIEP model at this image resolution and is not shown. The 
SPEMD model has a slightly more elliptical critical line com- 
pared to the PIEP. In contrast, the NFW(5kpc) model has 
a slightly rounder tangential critical line and some interest- 
ing properties near the centre of the lens, with the presence 
of a radial critical line before the highly demagnified centre 
of the lens. In spite of this property, the NFW(5kpc) lens 
model does not produce any images near the centre of the 
lens (given our source position). It is also apparent from Fig. 
0] that the magnification of each of the images is virtually 
the same for the successful models. Images A, B & C are 
the most highly magnified with a magnification factor ~ 10, 
while image D has a lower magnification of ~ 4. Both the 
similarity in the shape of the magnification contours and of 
the image magnifications themselves suggest that lens mod- 
els which have elliptical isothermal-like mass distributions 
on the scale of the Einstein radius will be able to fit the data 
(in terms of pure x 2 ). 

Fig. shows that the magnification contours and criti- 
cal line shape for the unsuccessful models differ significantly 
from the cases of the PIEP and SIE models. Close exami- 
nation of the model images show they incorrectly reproduce 
the brightness and/or location of the bright regions in the 
data. The SIS±7 model produces rounder contours and im- 
ages B & C are closer to the critical line. This explains why 
the model does not reproduce the smaller component of the 
source well, with the consequence that images Al, B & C in 
the model image are fainter than the data and image C is 
also incorrectly located. The constant M/L model has mag- 
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Figure 4. Contours of equal magnification for the successful lens models plotted over the data. From left to right: the PIEP, SPEMD 
and NFW(5kpc) models. Contours show the regions of the image where the magnification is 2, 5 and 20. The critical line for the lens 
model is between the two contours for magnification 20 in each case. The successful models have remarkably similar critical curves and 
magnifications around the locations of the images. 






Figure 5. Contours of equal magnification for the unsuccessful lens models plotted over the data. From left to right: the SIS+7, constant 
M/L and NFW(25 kpc ) models. Contours show the regions of the image where the magnification is 2, 5 and 20. The outer magnification 2 
contour for the NFW(25kpc) is outside the image region. The critical line for the lens model is between the two contours for magnification 
of 20 in each case. 



nification contours which are more elliptical than for the 
PIEP and SIE models. All of the images are further from 
the critical line and image D in particular is in a region of 
low (~ 2) magnification, resulting in an image which is the 
wrong shape (image B is incorrectly located) and consid- 
erably fainter than in the data. The NFW(25 kpc ) model 
has very unusual magnification contours and all of the im- 
ages are in a high (> 15) magnification region. This is a 
consequence of the small area of the caustic in the model, 
which causes the images to be smeared over several pixels; 
images B & D are too large while image D is stretched ra- 
dially more than in the data. Image C is not reproduced as 
a bright region at all. 

Thus, the data are sensitive to both the shape (elliptical 
vs quadrupole) of the mass distribution and the radial mass 
density profile. To compare the properties of each model, n 
and j3 (the local logarithmic slope of the surface mass dis- 
tribution) at each of the image locations (A,B,C,D) were 



calculated for each lens model. The results are shown in Ta- 
ble 0] The table shows that: 1) the properties of the PIEP, 
SIE and SPEMD are all very similar as expected, and 2) the 
particular value of k at the image locations is irrelevant, also 
as expected. The NFW(5kpc ) model has a local slope which 
is steeper than the PIEP/SIE and SPEMD models. Hence 
it has generated a poorer x 2 compared to these models (al- 
though still formally acceptable). This is consistent with the 
data having the ability to distinguish between models based 
partially upon the local slope of the mass distribution. 

This power in the data to distinguish between models 
raises the question: if the local slope and ellipticity of the 
mass distribution are important, what exactly is 'local'? In 
the context of ER 0047-2808, could we (for example) take 
the mass inside 0.5" in the SIE model and replace it with an 
equivalent point mass with no change in the lensing prop- 
erties? To quantify this 'local' p roperty, the ellip tical con- 
stant density slab mass models of lSchramml lll994l) are used. 
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Model 


K A 


KB 


K C 




Pa 


Pb 


Pa 


Pd 


PIEP 


0.43 


0.57 


0.44 


0.66 


1.01 


1.00 


1.00 


1.00 


SIE 


0.43 


0.57 


0.44 


0.67 


1 


1 


1 


1 


SPEMD 


0.41 


0.56 


0.42 


0.67 


1.08 


1.08 


1.08 


1.08 


NFW(5 kpc ) 


0.45 


0.49 


0.49 


0.60 


1.34 


1.36 


1.31 


1.30 


NFW25 


0.68 


0.73 


0.72 


0.81 


0.74 


0.72 


0.72 


0.67 


Constant M/L 


0.21 


0.43 


0.21 


0.59 


2.19 


1.94 


2.18 


1.85 


SIS + 7 


0.45 


0.50 


0.49 


0.58 


1 


1 


1 


1 



Table 4. Comparison of k and the local logarithmic slope (S <x R ") at each image position. The values for images A and D are 
somewhat uncertain because the images are extended. 



Consider the mass distribution in the lens as a stack of N 
constant density elliptical slabs with some elliptical radius 
r e (i) [r e (i — 1) < r e (i) for a monotonically decreasing surface 
mass density]. Each slab has well defined lensing properties 
and the deflection angle at any point is simply the sum of 
contributions from all slabs. For a point inside a slab, the 
deflection is simply that of a homeoidal ellipse. For a point 
outside the slab, the deflection is equivalent to a point in- 
side a different slab with foci which are at the same point 
as the original slab, a 'confocal ellipse'. If a mass distri- 
bution is to be distinguishable based on ellipticity and/or 
radial mass distribution, then the contribution from a slab 
must not be eq uivalent to some other slab or point mass. 
ISchramml 1 19941) gives us the answer for ER 0047-2808. We 
define the 'effective ellipticity' (e e ) at a point outside an 
elliptical slab as the ellipticity of an equivalent confocal el- 
liptical slab. It is easy to show that e e oc (r e /r) 2 for r > r e . 
That is, the effective ellipticity decreases rapidly for points 
increasingly further away from the outside of the slab. For 
r 3> r e , the slab looks like a point mass, as expected. Hence, 
for the models presented here where the error on elliptic- 
ity (axis ratio) is ~ 10%, the mass inside ~ 0.03" can be 
replaced with a point mass with no significant change in 
the lensing properties around the images. In a similar way, 
slabs with progressively larger radii (larger than the image 
radius) have progressively lower surface density (« oc r -1 for 
an SIE) , hence their contribution to the overall ellipticity is 
tx r^ 1 . Again, with an error of ~ 10% on the measured ellip- 
ticity, slabs with a radius > 10" are indistinguishable from a 
single mass sheet. Using this simple argument, we have de- 
fined what 'local' means for ER 0047-2808. The implication 
is that this lens cannot provide information about the mass 
distribution in the very centre of the galaxy or in the halo 
very far from the galaxy but there is a substantial range in 
between which does contribute to the image configuration. 

A final noteworthy property of ER 0047-2808 is the lack 
of necessity of any external shear for a successful lens model. 
In general, there is likely to be a shear contribution due to 
the environment. However, the lens is an isolated field ellip - 
tical with nearest neighbour ~ 1' away iWarren et alJl999t) . 
The shear generated by this object will be small (7 ~ 0.02) 
so it is not surprising that no shear was needed to model 
this system. The large shear required in the model of KT03 
was a consequence of having an oversimplified source. We 
emphasise the importance of using a suitable source model 
for resolved lenses before conclusions are made about the 
lens mass distribution. 

The combination of this work with that of KT03 has 
strongly constrained the mass distribution in the lens galaxy, 
showing it to be very close to isothermal (at least in the 



region of the images). Well constrained lens models are re- 
quired for Ho measurements using lensing time delays, thus 
ER 0047-2808 is an excellent candidate for monitoring of 
supernova events in the source. The lack of external shear 
also reveals a good de al about the overall m ass d istribution 
in the lensing galaxy. iKeeton et al] 1^)98) and iKochanekl 
(2002) have studied the alignment between visible matter 
and total matter distributions in lens galaxies and found 
they have an RMS misalignment < 10°. Our result adds 
weight to their conclusions. 



6 CONCLUSIONS 

We have performed a detailed analysis of ER 0047-2808 us- 
ing data from WFPC2 on board HST. The data show that 
the lensed image has four distinct bright regions and two 
regions of extended emission. We have modelled the image 
with six different lens models using sophisticated software 
based on the LensMEM algorithm. The software generates 
a best-fitting image and reconstructs the source brightness 
profile using a non-parametric source model. 

We have tested the ability of the data to distinguish be- 
tween a variety of lens models including isothermal, power- 
law, constant M/L and NFW mass distributions. We find 
that the 'canonical' SIE and PIEP lens models fit the 
data well and that a power-law model (E oc J? - ' 3 ) favours 
P = 1.08±0.03, slightly steeper than isothermal. In addition 
we find that we can fit the data with a mass profile based on 
the NFW profile, but only when the scale length is too small 
and central density too high for a dark matter halo expected 
in a large elliptical galaxy. Conversely, we find that a mass 
model based on constant M/L or a realistically-sized NFW 
halo cannot fit the data. We also find that the simple SIS+7 
lens model does not fit the data despite being an isothermal 
model. The data are good enough to distinguish between 
elliptical lens models and the SIS+7 which is a first-order 
approximation of an elliptical model. The difference appears 
to be in the shape of the critical curve between the ellipti- 
cal models and external shear model. We have discovered 
also that the source is actually a double. This has caused 
problems for models which assume the bright regions in the 
image come from a single source. 

Our results are somewhat different to those of KT03 
who found a larger Einstein Radius by 15%. However, their 
model assumed a single source and included a substan- 
tial shear component. The mass enclosed by KT03's model 
within a 1.17" aperture is consistent with our model. Our 
models show that the image locations and brightnesses can 
be explained neatly with the simple SIE/PIEP mass models 
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and the double source. We calculate the total M/Lb, inside 
1.17", to be 4.7/i65 ± 0.3 compared to 5.4 ± 0.8, at a larger 
radius of 1.34", in KT03. 

We have qualitatively evaluated the behaviour of the 
lens models around the location of the lensed images by plot- 
ting contours of constant magnification. We found that the 
SIE and PIEP models are indistinguishable for these data 
and that any elliptical mass distribution with an isothermal- 
like density profile around the images is likely to be able 
to fit the data. We found that the lens models which could 
not fit the data produce images with incorrect magnification 
and/or location. 

Finally, we note that the lens model requires no exter- 
nal shear to explain the data. The well constrained mass 
distribution, lack of external shear and isolation of the lens 
galaxy all suggest that ER 0047-2808 is an ideal candidate 
for further work such as monitoring for supernova events in 
the source, or making a detailed study of the properties of 
the galaxy's dark halo. 
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